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(54) System and method for correcting defects in captured images 



(57) A captured image is processed with one or 
more correction processes selected from a plurality of 
such processes, each associated with correction of a 
specific type of image defect, in order to improve the 
appearance of a viewed image generated from the cap- 
tured image. Preliminary to the image processing, meta 
data related to image capture is obtained that is unique 
to each captured image, where the meta data is capable 
of indicating whether the specific types of image defects 
are likely to be present in the viewed image generated 
from the captured image. The processing then involves 
predicting the presence of the image defects based at 
least in part on the meta data, thereby generating proc- 
ess application criteria which indicate a level of image 
defect that if left untreated would reduce the perceived 
quality of the viewed image; selecting one or more cor- 
rection processes to employ on the captured image 
based on the process application criteria; and applying 
the one or more selected correction processes to the 
captured image to generate the viewed image. 
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Description 



FIELD OF THE INVENTION 

5 [0001] This invention relates generally to the field of digital image processing, and, in particular, to the prediction and 
correction of image defects in a photograph. 

BACKGROUND OF THE INVENTION 

10 [0002] Photographic systems produce a wide range of image quality when operated by amateur, often referred to 
as "point-and-shoot", photographers. If the photographic environment for a given scene is well suited to the image 
capture system (e.g. subjects are stationary and within the focus range, ambient light level is uniform and of sufficient 
intensity, and lens magnification is appropriate for the subject matter), good results are typically obtained. However, 
when these conditions are not present, image defects may be introduced due to failures in the capture or reproduction 

15 system, thereby reducing the quality of the final viewed image. To minimize the effects of suboptimal image capture 
conditions, camera designers have attempted to compensate by adding features intended to expand the range of light 
levels and' distances where images can be captured. Unfortunately, these features often solve the primary problem, 
but add a secondary, sometimes severe, Image defect. 

[0003] For example, if the intensity of the ambient light is insufficient to provide adequate exposure, and the primary 
20 subject is located less than 20 feet from the camera, most built-in electronic flash units are able to provide auxiliary 
illumination sufficient to at least partially expose the primary subject. However, even if the primary subject now receives 
adequate illumination, the flash may introduce image defects. 

[0004] As is well known in the art, the image defect known as redeye may occur when the angle between a narrow 
light source, the photographic subject, and the camera lens is less than approximately three degrees. This criterion is 
25 frequently met in flash exposures from compact cameras. The light from the flash enters the pupil nearly on-axis and 
propagates to the fundus of the eye, where it is reflected back out of the eye, having been colored red by the blood 
vessels in the fundus. The light exits the eye in a narrow cone, and if the camera lens falls within that cone,. the red 
reflection will be recorded, and may appear in the final image as a red glow in the pupils, which is very undesirable in 
terms of image quality. 

30 [0005] Redeye is more objectionable when the size of the pupil in the viewed image is larger and when the red 
saturation of the pupil is greater. The former may occur when the pupil is dilated, as occurs at low ambient light levels, 
or when the subject is rendered at a larger size in the image, for example due to shorter camera to subject distance, 
longer camera lens focal length, higher printing magnification (including zoom and crop), and/or shorter viewing dis- 
tance. The primary techniques used in the camera to reduce or eliminate redeye are: increasing flash to lens separation; 

35 firing a preflash to transiently stop down the pupil in response to the bright light; and decreasing lens focal length and/ 
or electronic zoom. 

[0006] While all these methods are efficacious, all have associated disadvantages. Increased flash-lens separation 
may lead to more expensive and bulkier cameras and produces more noticeable shadows due to the farther off-axis 
lighting. After a preflash is fired, the eye requires half a second or more to respond fully, and during this delay between 

40 the preflash fire and the image capture, facial expressions of the subject often change in an undesirable fashion due 
to the annoyance and surprise of the preflash. The preflash also increases camera cost reduces the power available 
during the main flash pulse, and increases battery consumption. Finally, restriction of optical or electronic zoom factors 
interferes with the photographer's ability to obtain the desired composition, with the subjects appearing large enough 
in the image to provide a pleasing rendition. 

45 [0007] Given the disadvantages of the in-camera redeye reduction techniques summarized above, and the increased 
availability of digital printing devices capable of making corrections to selected portions of individual images, consid- 
erable effort has been directed towards the development of techniques for locating and correcting the redeye defect 
during the photofinishing (digital printing) process. 

[0008] U.S. Patent Number 5,748,764 issued 5 May 1998 teaches a method of locating and correcting the redeye 
50 image defect in an image. U.S. Patent Number 5,892,837 issued 6 April 1999, and related commonly assigned US. 
Patent Number 6,292,574 issued September 1 8, 2001 and U.S. Patent Number 6,151 ,403 issued November 21 , 2000 
generally describe additional methods suitable for locating human eyes in an image, and specifically describe locating 
and correcting the appearance of human eyes with the redeye image defect. These digital redeye removal techniques, 
while effective, are computationally intensive, and therefore increase the time required to optimally render and repro- 
55 duce copies of captured images. The time required to perform these operations may in some cases be the rate limiting 
step in automated high speed printing operations. If the redeye defect location and correction processes are applied 
to every image in a customer order, even though only a portion of those images actually contain the defect, productivity 
and profit may be reduced. In addition, if computational time is spent searching for redeye defects in every image, 
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other beneficial image processing operations such as tone scale mapping, digital noise reduction and sharpening may 
not be possible in the time interval allocated for each image. It is therefore desirable to be able to predict when redeye 
will occur, and to invoke the redeye location and correction processes only when needed. 

[0009] From an extensive study to determine whether it is possible to predict from data collected at the time the 
5 original scene is photographed, the probability and severity of the redeye defect that will be present in the final image, 
it was discovered that the extent of the redeye defect depends primarily on the following factors: subject race, subject 
age, prefiash illumination level, flash-to-lens separation, camera to subject distance, ambient light level, camera lens 
focal length, reproduction magnification, and final image viewing distance. In the present invention these factors are 
used to predict, on an image by image basis, the severity of the redeye defect, and that information is transferred from 
w the camera to the photof inishing system where it commands the automatic printer control system, or in the case of 
human assisted printers, alerts the operator, to apply redeye defect location and correction techniques only when 
warranted, thereby improving picture quality and enhancing photofinishing productivity. 

[0010] In addition to the redeye image defect, it is well-known that the physics of light intensity loss as a function of 
distance from a narrow source, such as an electronic flash tube, often leads to a defect in lighting contrast and con- 
scqucntiy distorted tone reproduction in the final viewed image. Specifically, with every doubling of camera-to-subject 
distance tnc hghi intensity per unit area on the subject drops by a factor of four. For example, if the primary subject is 
locHted 6 tcci from the camera and the background is located 12 feet from the camera, the captured image of the 
Dnc^qrojnonrtL nn exposure level only onequarterthat of the image of the primary subject. This causes the background 
i . m u c» ^ f fcer than the primary subject does in the final viewed image. Because light falls off according to this 

c-j !j.-k:«w- f espect to distance, the exposure difference between the primary subject and the background 
. irr \* .frustrated above, particularly when images are captured outdoors at night or in large rooms. When 

p" >c^r : m ~- .~>*ge having a large exposure range (high contrast scene) with no knowledge of which portion of 
m ^ r*v iviT-.i'v subject, the exposure control system in the printer often calculates an average or area-weighted 
c»pot.*i* •* f » c * c cssively lighten the primary subject. This defect is particularly detrimental in pictures of people, 

25 whow vcl -ft m .^tned out and lack proper flesh reproduction.^ 

[0011] i' ^>.'v o* nigh contrast scenes universally contained overexposed primary subjects and underexposed 
b*c*Toj'vn »«,jst rated above, it would be practical to introduce a darken bias when printing all high contrast 
sronr**. i in»r—..~.„!n#y there is a class of scenes known as backlight that are high in contrast, but have a subject-to- 
b*c*rrc;.'nr! nrw.jro ratio that is opposite that of flash scenes. In the case of backlight scenes the illumination source 

30 is often De^mj inc primary subject, or the primary subject is shaded by another object, such as a tree or a building, 
and nereico eceives only a fraction of the ambient illumination. Consequently, the primary subject is underexposed 
relative :c tic background. In this case if the darkening bias needed to correct harsh flash scenes was applied, the 
already dh^k primary subject would be rendered even darker, having the effect of further reducing the image quality. 
[0012] The information exchange (Ix) feature of the Advanced Photo System offered by Eastman Kodak Company 

35 may mn*c jsc oi information collected at the time of image capture and passed to the printer to indicate whether, for 
the currcn! image the electronic flash was employed. (The Advanced Photo System specifications documents can be 
found at http /www kodak.com/global/en/consumer/APS/redBook/specslndex. shtml.) If the flash was fired, and a high 
contrast scene is inferred from the scanned image densities, a darkening bias can be applied to the image during 
printing This information helps discriminate between backlight and harsh flash shots, and increases the probability 

to that the primary subject will be printed to the proper lightness. However, because in both backlight and harsh flash 
scenes the dynamic range of the scene may exceed the tonal range of the print material, the primary subject and 
background can not be simultaneously rendered properly by invoking a full-image-field darken (in the case of harsh 
flash) or lighten (in the case of backlight) printing correction. This means that optical (analog) printing systems, which 
are only capable of producing full-field exposure corrections, can not produce optimal renditions of high contrast scenes. 

45 [0013] Recent advances in digital image processing make practical methods for digitally segmenting the image field, 
analyzing tnc dynamic range, and adjusting tone reproduction (lightening or darkening) on an image area specific 
basis. By remapping the tone reproduction in this fashion, both the overexposed and underexposed portions of high 
contrast scenes can be rendered within the tonal range of the print material, thereby making the information in both 
regions visible in the final image. These digital area-specific tone scale remapping techniques, while effective, are 

so computationally intensive, and therefore increase the time required to optimally render and reproduce copies of cap- 
tured images. The time required to perform these operations may in some cases be the rate limiting step in automated 
high speed printing operations. If the tone scale remapping techniques are applied to every image in a customer order, 
even though only a portion of those images actually contain the defect, productivity and profit may be reduced. In 
addition, if computational time is spent searching for tone scale defects in every image, other beneficial image process- 

55 ing operations such as redeye location and correction, digital noise reduction and sharpening may not be possible in 
the time interval allocated for processing each image. It is therefore desirable to be able to predict when tone scale 
defects will be present., and to invoke tone scale remapping processes only when needed. 

[0014] From a study to determine whether it is possible to predict from data collected at the time the original scene 



3 



is photographed, the probability and severity of the tone scale defect that will be present in the final image, it was 
discovered that the extent of the tone scale defect depends primarily on the following factors: flash state (full.fill.off), 
primary subject light level, background light level, primary subject distance, background distance, and if available, state 
of the manual or automatic camera backlight exposure compensation control. In the present invention, these factors 
5 are used to predict, on an image by image basis, the severity of the tone scale defect, and that information is transferred 
from the camera to the photofinishing system where it commands the automatic printer control system, or in the case 
of human assisted printers, alerts the operator, to apply tone scale defect location and correction techniques only when 
warranted, thereby improving picture quality and enhancing photofinishing productivity. 

[0015] If the ambient light level is not sufficient to provide adequate exposure, and the flash is deactivated or the 

10 primary subject is located beyond the maximum flash range, the image capture system will produce an underexposed 
image. In the case of film-based camera systems underexposure leads to latent image formation in primarily the most 
sensitive (fastest) layer, comprised of the largest silver halide grains. When processed and printed, images comprised 
of these fast, large grains permit a reproduction of the scene to be created, but the final viewed image typically contains 
noticeable grain structure, which masks fine detail and lowers the perceived image quality. The appearance of the 

15 grain, referred to more generally as image noise, becomes more objectionable when the reproduction magnification 
is increased, for example, in enlargements, pseudo-panoramic or pseudo-telephoto print formats. 
[0016J In the case of digital still cameras (DSCs) with, for example, CCD or CMOS image sensors, the photographic 
sensitivity (exposure index) of the sensor may be adjusted automatically or manually, by the photographer, in response 
to the scene light level, to attempt to maintain adequate tone reproduction. The photographic sensitivity is adjusted by 

20 changing the gain of the sensorsignal amplifier, taking into account the color temperature (white balance) of the ambient 
illuminant. When the ambient light level is high (bright scene), the amplifier gain is low, thereby producing a high 
(favorable) signal-to-noise ratio (SNR). When the ambient light level is low (dim scene), the amplifier gain is increased, 
which produces a low (unfavorable) SNR. When the gain is increased in this fashion, the tone reproduction of the 
image is improved relative to the standard amplifier gain; however, due to the low SNR, the final viewed image will 

25 typically contain noticeable noise, analogous to the grain in underexposed film images, which masks fine detail and 
lowers the perceived image quality. The appearance of noise defects becomes more objectionable when the repro- 
duction magnification is increased, for example, in enlargements, pseudo-panoramic or pseudo-telephoto (electronic 
200m) print formats. 

[0017] Techniques such as those exemplified in The Sigma filter, described by Jong-Sen Lee in the journal article 

30 Digital Image Smoothing and the Sigma Filter, Computer Vision, Graphics, and Image Processing Vol 24, p. 255-269, 
1983, are useful as noise reduction algorithms to enhance the visual appearance of the processed digital image. These 
digital area-specific noise reduction techniques, while effective, are computationally intensive, and therefore increase 
the time required to optimally render and reproduce copies of captured images. The time required to perform these 
operations may in some cases be the rate limiting step in automated high speed printing operations. If the digital noise 

35 reduction techniques are applied to every image in a customer order, even though only a portion of those images 
actually contain the defect, productivity and profit may be reduced. In addition, if computational time is spent searching 
for and correcting noise defects in every image, other beneficial image processing operations such as redeye location 
and correction, tone scale remapping and sharpening may not be possible in the time interval allocated for processing 
each image. It is therefore desirable to be able to predict when noise defects will be present, and to invoke digital noise 

40 reduction processes only when needed, 

[0018] From an extensive study to determine whether it is possible to predict from data collected at the time the 
original scene is photographed, the probability and severity of the noise defect that will be present in the final image, 
it was discovered that the extent of the noise defect depends primarily on the following factors: Reproduction magni- 
fication; final image viewing distance; baseline exposure index film noise, or, in the case of DSCs, baseline exposure 

45 index sensor noise and the state of the manual or automatic DSC (R,G,B) exposure index control, which determines 
the sensor amplifier gain level; and the film or the sensor exposure level. In the present invention these factors are 
used to predict, on an image by image basis, the severity of the noise defect, and that information is transferred from 
the camera to the photofinishing system where it commands the automatic printer control system, or in the case of 
human assisted printers, alerts the operator, to apply noise defect location and correction techniques only when war- 

50 ranted, thereby improving picture quality and enhancing photofinishing productivity. 

[0019] Even if the photographic environment provides ambient light that is uniform and of sufficient intensity to provide 
an exposure level that obviates the need for electronic flash or high-noise ambient captures, and the primary subject 
is within the focus range of the camera, the camera lens magnification provided by the normal lens (often defined as 
the diagonal dimension of the image capture frame) may be insufficient to capture an image of the primary subject that 

55 js the preferred size in the final viewed image. The size of the primary subject in the final viewed image is proportional 
to a quantity known as the angular magnification (AM) of the system, which can be characterized by the following 
equation: 
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AM = [(F1)(Mr)]/Vd _ _. 

Where: 

5 

Fl = camera lens focal length (specified in inches) 

Mr = reproduction magnification (ratio of image to display size) 

Vd = final image viewing distance (specified in inches) 

10 [0020] The eye-to-display separation (viewing distance) has been found to vary with respect to final display size 
according to the following formula disclosed by the present inventors, in columns 43-44 of commonly-assigned U.S. 
Patent Number 5,323,204: 

Vd = 3.64+1 1.34[log 10 (D)] 

Where: 

D = the diagonal dimension of the final display (specified in inches) 

20 

[0021] The most common method for increasing the AM involves the inclusion of telephoto or variable (zoom) focal 
length image capture optics on the camera. This approach produces larger subjects in the final viewed image by 
increasing the image capture magnification and maintaining a standard (full frame) printing magnification. Other meth- 
ods involving pseudo-telephoto optical printing or electronic zoom digital printing are well known in the art. These 

25 techniques produce larger subjects in the final viewed image by increasing the printing magnification and cropping out 
a portion of the image frame, while retaining the standard print size (e.g/4"x 6 inch) and the standard image capture 
lens focal length. Finally, by simply producing a larger (e.g. 8x10 inch) final image size, the reproduction magnification 
is increased, and therefore the AM and perceived subject size, even after including the longer final viewing distance, 
are also larger. The increase in AM provided by the aforementioned techniques may lead to a more pleasing compo- 

20 sition; however, it also magnifies image blur resulting from inadequate lens depth-of-f ield, subject motion, and photog- 
rapher hand tremor. The magnified image blur causes sharpness defects to be visible in the final viewed image. 
[0022] Recent advances in digital image sharpness enhancement, as exemplified in U.S. Patent Number5,398,077, 
teach methods for digitally segmenting the image field, analyzing the content to separate signal and noise components, 
and boosting the sharpness on an image area specific basis. These digital area-specific sharpening techniques, while 

35 effective, are computationally intensive, and therefore increase the time required to optimally render and reproduce 
copies of captured images. The time required to perform these operations may in some cases be the rate limiting step 
in automated high speed printing operations. If the digital sharpening techniques are applied to every image in a cus- 
tomer order, even though only a portion of those images actually contain the defect, productivity and profit may be 
reduced. In addition, if computational time is spent searching for and correcting sharpness defects in every image, 

40 other beneficial image processing operations such as redeye location and correction, tone scale remapping and noise 
reduction may not be possible in the time interval allocated for processing each image. It is therefore desirable to be 
able to predict when sharpness defects will be present, and to invoke digital sharpening processes only when needed. 
[0023] From an extensive study to determine whether it is possible to predict from data collected at the time the 
original scene is photographed, the probability and severity of the sharpness defect that will be present in the final 

45 image, it was discovered that the extent of the sharpness defect depends primarily on the following factors: reproduction 
magnification, final image viewing distance, camera lens focal length, DSC resolution or camera film speed, shutter 
time, subject motion, photographer hand tremor, and subject distance if outside of focus range. In the present invention, 
these factors are used to predict, on an image by image basis, the severity of the sharpness defect, and that information 
is transferred from the camera to the photofinishing system where it commands the automatic printer control system, 

50 or in the case of human assisted printers, alerts the operator, to apply sharpness defect location and correction tech- 
niques only when warranted, thereby improving picture quality and enhancing photofinishing productivity. 
[0024] One proposal for optimizing multiple image processing operations is described in U.S. Patent Number 
5,6g4,484 ; issued Dec. 2, 1997, to Cottrell et al. Cottrell et al. disclose an image processing system that proposes to 
optimize the perceptual quality of images undergoing a series of image-processing operations selected by an operator. 

55 The system consists of a set of selected image-processing operations, an architecture, and a control system. These 
elements take into consideration profiles of source characteristics from which the images are generated, profiles of 
output device characteristics, and the impact that image processing operations (individually or in concert) will have on 
perceived image quality Control parameters for the individual image processing operations are modified by optimizing 
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an image quality metric (a single numerical quality) based on mathematical formulas relating objective metrics (such 
as sharpness, grain, tone, and color) with perceived image quality. In the method described by Cottrell et al., the values 
tor the individual control parameters are varied over useful ranges until the image quality metric achieves an optimal 
value. Besides involving significant computation resources to evaluate the multitude of parameter permutations, this 
method requires operator intervention to select the set of image processing operations that will be applied in each case 
[0025] In U.S. Patent Number 5,835,627, issued Nov. 10, 1998 to Higgins, Hultgren and Cottrell, the process de- 
scribed above in the '484 patent is refined with the addition of an algorithm selector that tries each possible sequence 
of image processing operations and a customer satisfaction index (CSI), which proposes to balance the perceived 
image quality and the image processing time, as exhibited by the different image processing sequences. As was the 
case in U.S. Patent Number 5,694,484, the image quality estimate is based on device profiles that are constant value 
inputs for each image source and downstream device, and that are typically generated during calibration of the indi- 
vidual devices in a factory or laboratory setting (see U.S. 5,835,627, col. 3, line 60-65). Besides involving significant 
computation resources to evaluate the multitude of parameter permutations as in the '484 patent, this refinement in- 
creases the amount of computation by causing the process to iterate through each new sequence until an optimal CSI 
is obtained. 

[0026] Despite the elaborate methodology disclosed in U.S. Patent Numbers 5,694,484 and 5,835,627, such systems 
fail to recognize the importance and use of capture-specific data, that is : variable data collected at the time of Image 
capture, to predict on an image by image basis the best selection of image defect correction algorithms to apply, m 
particular, it would be desirable to make advantageous use of scene- and exposure-specific data to predict the best 
selection of image defect correction algorithms to apply. 

SUMMARY OF THE INVENTION 

[0027] It is an object of the present invention to improve the quality and efficiency of digital printing by applying image 
defect location and correction processes only when the current image is predicted to have a level of image defect that 
if left untreated would reduce the perceived quality of the final viewed image. 

[0028] It is a further object to make use of camera, scene, and demographic data collected at the time of image 
capture to predict on an image by image basis the best selection of image defect correction algorithms to apply. 
[0029] The present invention is directed to overcoming one or more of the problems set forth above. Briefly summa- 
rized, according to one aspect of the present invention, the invention resides in a system and method for processing 
a captured image with one or more correction processes selected from a plurality of such processes, each associated 
with correction of a specific type of image defect, in order to improve the appearance of a viewed image generated 
from the captured image. The inventive method includes the steps of (a) collecting meta data related to image capture 
that is unique to each image that is captured, wherein the meta data is capable of indicating whether the specific types 
of image defects are likely to be present in the viewed image generated from the captured image; (b) predicting the 
presence of the image defects based at least in part on the meta data, thereby generating process application criteria 
which indicate a level of image defect that if left untreated would reduce the perceived quality of the viewed image; (c) 
selecting one or more correction processes to employ on the image based on the process application criteria; and (d) 
applying the one or more selected correction processes to the image to generate the viewed image. 
[0030] The meta data, which may be collected at the time of image capture or at some other time to the extent 
possible, such as at a photofinishing kiosk, includes scene, camera or demographic data specifically related on an 
image-by-image basis to the image capture. Moreover, the step of predicting the presence of the image defects may 
also predict the severity of the defects and the strength of the corresponding correction process can be altered in 
response to the degree of severity. In addition, the collection of meta data may further include the collection of display 
parameters of the viewed image generated from each image that is captured, wherein such display parameter meta 
data is also capable of indicating whether the different types of image defects are likely to be present in the viewed 
image. 

Advantageous Effect of the Invention 

[0031] The present invention is advantageous in that it improves the quality and efficiency of digital photofinishing 
processes, whether fully automated or operator assisted. Specifically, the advantage is realized by applying image 
defect location and correction processes only when needed forthe current scene, thereby eliminating time consuming 
operations that would fail to substantially improve the quality of the current image. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0032] 

5 FIG. 1 is a block diagram of a digital image reproduction system according to the invention; 

FIG. 2 is a functional block diagram of an image processing system illustrating the image analysis, image defect 

prediction, image processing (defect correction), and image output steps of the present invention; 

FIG. 3 is a logic diagram illustrating one technique of using data captured at the time of image capture to decide 

when to apply noise defect location and correction processes in digital image reproduction apparatus; 
10 FIG. 4 is a logic diagram illustrating one technique of using data captured at the time of image capture to decide 

when to apply redeye defect location and correction processes in digital image reproduction apparatus; 

FIG. 5 is a logic diagram illustrating one technique of using data captured at the time of image capture to decide 

when to apply tonescale defect location and correction processes in digital image reproduction apparatus; 

FIG. 6 is a logic diagram illustrating one technique of using data captured at the time of image capture to decide 
15 when to apply sharpness defect location and correction processes in digital image reproduction apparatus. 

FIG. 7 is a diagram illustrating a film camera adapted to write meta data on a photographic film. 

FIG. 8 is a diagram Illustrating a digital camera adapted to write meta data on a digital record. 

DETAILED DESCRIPTION OF THE INVENTION 

20 

[0033] Because image processing systems employing defect recognition and correction are well known, the present 
description will be directed in particular to attributes forming part of, or cooperating more directly with, system and 
method in accordance with the present invention. System and method attributes not specifically shown or described 
herein may be selected from those known in the art. In the following description, a preferred embodiment of the present 

25 invention would ordinarily be implemented as a software program, although those skilled in the art will readily recognize 
that the equivalent of such software may also be donstructed in hardware. Given the system as described according 
to the invention in the following materials, software not specifically shown, suggested or described herein that is useful 
for implementation of the invention is conventional and within the ordinary skill in such arts. If the invention is imple- 
mented as a computer program, the program may be stored in conventional computer readable storage medium, which 

30 may comprise, for example; magnetic storage media such as a magnetic disk (such as a floppy disk or a hard drive) 
or magnetic tape; optical storage media such as an optical disc, optical tape, or machine readable bar code; solid state 
electronic storage devices such as random access memory (RAM) : or read only memory (ROM); or any other physical 
device or medium employed to store a computer program. 

[0034] Prior to providing a detailed description of each embodiment of the present invention, the methods used to 

35 set the image quality switch-points for activating the image defect location and correction processes will be discussed. 
Because it is the purpose of this invention to predict from data collected at the time of image capture the presence and 
severity of image defects, to determine which correction processes to employ, each of the defects must be assessed 
with respect to the same image quality standard. To accomplish this goal, perceptually relevant image quality assess- 
ment and modeling techniques disclosed by the present inventors in the proceedings of The Society for Imaging Science 

40 and Technology (IS&T), Image Processing Image Quality Image Capture Systems (PICS 2000) Conference, ISBN: 
0-89298-227-5, are employed. Specifically, the papers entitled "Characterization and Prediction of Image Quality 1 by 
B.W. Keelan, and "Use of System Image Quality Models to Improve Product Design " by R. B. Wheeler, which are 
incorporated herein by reference, should be consulted for additional background information. The image quality scale 
is based on just-noticeable-difference (JND) units of quality. In experiments, a JND is defined as the smallest image 

45 quality difference that can be detected by 50% of observers in forced-choice (no ties allowed) paired image compari- 
sons. Stated another way, when two images are separated by exactly one JND, 50% of human observers will perceive 
the quality advantage of the better image and rate it best. The other 50% of the human observers will not perceive the 
difference, but will guess correctly half the time. Consequently, in a forced-choice comparison between two images 
that differ by exactly one JND, the higher quality image will be rated better 75% of the time. As the quality difference 

so between image pairs becomes larger there will be nearly universal agreement as to which one is better, as a result, a 
scale covering a wide range of quality is composed of many JNDs. For example, when cast in subjective quality terms 
such as excellent, very good, good, fair, poor, etc., we have found that a difference of about 6 JNDs constitutes a full 
quality category. 

[0035] While it is possible to set image quality switch-points for activating image defect location and correction proc- 
55 esses at different absolute quality levels without departing from the spirit of the invention, the preferred embodiment 
employs a three JND (one-half quality category) criteria. This provides a reasonable balance between image quality 
enhancement and photofinishing throughput (images processed per unit of time). If the image quality switch-point is 
set at a lower JND value (e.g. one JND of degradation), the image defect location and correction processes will be 
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invoked more frequently, which may lead to higher average image quality, but lower throughput due to extended imaqe 
process.ng time. If the quality switch-point is set at a higher JND value (e.g. 6 JNDs of degradation), the image defect 
location and correction processes will be invoked less frequently, which may lead to lower average image quality but 
higher throughput due to shortened image processing time. 

[0036] In the detailed description of our preferred embodiments, image defect correction switch-points are defined 
that either activate or deactivate the image defect location and con-ection processes. It is to be understood that while 
this description pertains to correction processes that are applied in the same manner every time the quality loss due 
to an image defect is equal to or greater than the switch-point (e.g. 3 JNDs), the strength of the image defect correction 
processes can be altered in response to the degree of image quality loss predicted for the current scene. 
[0037] For example, in the case of sharpness correction, as the predicted level of the image defect becomes worse 
the gain of the spatial filter applied in the sharpening operation can be increased. Similarly, in the case of noise defect 
correction, customized noise correction tables can be applied in response to the predicted magnitude of the image 
defect. In addition, the preferred embodiment of the present invention can be used to determine which image defect 
correction processes should be activated, and the strength of the conections employed in those activated processes 
can be determined by analyzing the image pixel data using a series of pixel data predictors that correlate with the 
degree of the degradation caused by the defect. For example, the gradient of edges can be used to estimate the 
strength of the spatial filtering needed to correct sharpness defects. 

[0038] As noted previously, the inventive process uses information collected at the time of image capture, which is 
referred to as meta data, to predict the presence of image defects in a photograph, and to subsequently decide when 
to apply image defect correction processes in digital photofinishing. A variety of meta data describing the scene and 
camera conditions used to capture the cunent image, can be recorded by the camera and transferredto photofinishing 
equipment, as is described in commonly assigned U.S. Patent Number 5,229,810, which is incorporated herein by 
reference. In the case of cameras employing silver halide based films containing an integral magnetic layer, commonly 
referred to as Advanced Photo System (APS) films, the information recording device preferably comprises a magnetic 
recording head for magnetically encoding data on the surface of the film. Alternatively, in the case of film cameras 
without magnetic writing capability, latent image barcode data may be exposed outside the area of the primary image 
frame, and later decoded in the photofinishing equipment. In the case of DSCs, other known data recording techniques 
may also be utilized such as optical or magnetic recording on separable media such as disks or integrated circuit cards 
In the detailed description of the embodiments that follows, it is understood that the method by which information is 
exchanged between the image capture device and the photofinishing equipment, while not specifically identified in 
each case, can be accomplished with any of the aforementioned well-known methods. 

[0039] In FIG. 1 , a general digital image processing system 10 useful in the practice of the invention is shown in 
which input pictorial image data and related image classification parameters are provided by means of one of a variety 
of indicated devices. The illustrated input devices include a photographic film scanner 12 which optically scans the 
•mage frames on a film strip and converts the scanned signals into digital image data. If the scanner is capable of 
reading APS film, then the scanner will typically read from the magnetic layer or optical bar code on the film information 
encoded in the APS meta data information exchange (IX) and manufacturer data areas. Such meta data may include 
film type, camera settings, scene conditions, intended print format, and other data fields. Other possible image input 
devices include a digital file reader 14 which may contain data from a variety of sources, including digital cameras or 
a picture disk reader, a network input (e.g. modem) 16 which receives digital file data from a remote, central source 
as in the case of Kodak Picture Network, or an order entry station input device 18 located at a retail store which scans 
a customers film, reads digital disks, and accepts order instructions, including print aspect ratio, size, zoom, crop and 
magnification instructions. This data is then inputto an image processing computers which may also include a display 
monitor 22 and a user data input device such as a keyboard 24. In the case of a home-based personal computer for 
example, the key board may be used to input some of the scene, camera, and output size related data mentioned 
above. Included in the image processing functions of the computer 40, in accordance with the present invention is a 
process that makes use of camera, scene, and demographic factors to predict the presence of image defects in a 
photograph and subsequently to applies image defect correction means only when needed. 

[0040] The output of the image processing computer 40 is applied to an appropriate output path for generation of 
hardcopy images. Representative output paths are illustrated and include a printer 26, such as a thermal dye printer 
or inkjet pnnter which are exemplary of printers useful for home computer use. Alternatively, the output path may 
comprise retail photofinisher equipment 28, such as a Noritsu 2711 Series Printer. Yet another exemplary output path 
comprises data communications device 30 which communicates with, for example, a remote commercial photofinishing 
laboratory 32 using a CRT or other photographic printer. Alternatively, the essential functions of image processing 
system 10, such as film scanning and conversion of scan data to digital image files, reading of digital camera image 
files, reading of information pertaining to camera, scene, and output parameters, and making use of camera scene 
and demographic data to predict the presence of image defects in a photograph, and subsequently applying image 
defect correction means only when needed, can be incorporated in an integrated apparatus such as a retail photofin- 



8 



XXJID: <EP 129651 0A2J_> 



EP1 296 510 A2 



isher unit. 

[0041] An important feature of the system shown in Figure 1 is the collection ot metadata at the time of image capture 
indicating, or capable of indicating, whether a defect is likely to be present in the final viewed image. In the preferred 
embodiments, this is accomplished with data collected by either a film camera or a digital camera, although some of 

5 the meta data (e.g., demographic data and display parameters) could be collected at the order entry station 1 8 or some 
other location subsequent (or prior) to the time of capture. In a typical film camera embodiment, as shown in Figure 7, 
a film camera 200 transports a film strip 201 between the reels 205a,b of a film cartridge and a take-up sprocket 
respectively. The camera 200 includes a magnetic read/write head 210 facing a magnetic layer on the unsensitized 
side of the film strip 201 . A microprocessor 215 controls magnetic data recording or playback by the head 21 0 through 

10 head electronics 220. 

[0042] The microprocessor 215 may accept meta data to be magnetically recorded on the film strip 100 from the 
camera user or the camera mechanisms themselves through camera controls 225, such information pertaining, for 
example, to the desired display parameters, lens parameters (e.g., focal length, F-number, camera lens focus range), 
shutter speed, autofocus distance measurements of subject and background, backlight and flash fire state indicators, 

15 and the like, for ultimate use by the photofinisher. If a suitable input device is provided, for example a keypad, demo- 
graphic data could be generated at this time. The microprocessor 21 5 may also accept scene related information from 
scene sensors 230 to be magnetically recorded on the film strip 100 for ultimate use by the photofinisher. Such infor- 
mation may include the ambient light level of the primary subject and background, and the like. 
[0043] The advantage of the longitudinal dedicated track format is that magnetic recording of data on the film strip 

20 201 may be performed by the camera using a relatively stationary head (i.e. the head 210) by buffering all of the data 
to be recorded in a particular frame in a particular camera track and then transmitting the data to the head just as the 
film is being wound to the next frame. 

[0044] The microprocessor 215 includes a read only memory 240 containing instructions sufficient to ensure that 
each type of information received is recorded in the correct one of the dedicated camera tracks in accordance with a 

25 universal pre-arrangement common to both the camera and the photofinisher. (The aforementioned APS information 
exchange (IX) specifications illustrate dedicated camera tracks for meta data storage and information exchange. Sec- 
tion 10, "Writing and Reading Magnetic Information", and more specifically Section 10.4 "Data Dictionary", contain the 
relevant information.) For this purpose, the microprocessor sorts and buffers each piece of information in compliance 
with the instructions stored in the read only memory 240. The microprocessor also includes a ROM 250 with other 

30 camera-specific data, such as main flash and preflash guide number and camera to flash separation (if the flash is 
integral with the camera), lens-specific data (e.g., focal length and focus range, if the lens is integral with the camera), 
and the like, as well as a RAM 260 for storing film-specific information read from the film cassette, such as film ISO 
speed read from the DX coding on the film cassette. The meta data in the ROM 250 and the RAM 260 is then mag- 
netically recorded on the film strip 100 for ultimate use by the photofinisher. 

35 [0045] In a typical digital camera embodiment, such as shown in Figure 8, a digital camera 300 includes a lens 340 
that directs image light from a subject (not shown) through an aperture/shutter controller 341 and an anti-aliasing filter 
342 upon an image sensor, which is typically a CCD or CMOS sensor 344. The sensor 344 generates an image signal 
that is processed by an analog video processor 346 before being converted into a digital image signal by an analog 
to digital (A/D) converter 348. The digitized image signal is temporarily stored in a frame memory 350, and then com- 

<o pressed by a digital signal prooessor 352. The compressed image signal is then stored in a data memory 354 or, if a 
memory card 356 is present in a memory card slot of the camera, transferred through a memory card interface 358 to 
the memory card 356. In this embodiment, the memory card is adapted to some appropriate interface standard, such 
as the PCMCIA card interface standard as described in the PC Card Standard, Release 2.0, published by the Personal 
Computer Memory Card International Association, Sunnyvale, Calif., September, 1991 . 

45 [0046] Electrical connection between the memory card 356 and the camera 300 is maintained through a card con- 
nector 359 positioned in the memory card slot. The card interface 358 and the card connector 359 provide, e.g., an 
interface according to the aforementioned PCMCIA card interface standard. The compressed image signal may also 
be sent to a host computer, which is connected to the camera 300 through a host computer interface 360. A camera 
microprocessor 362 receives user inputs 364, such as from a shutter release, and initiates a capture sequence by 

50 triggering a flash unit 366 (if needed) and signaling a timing generator 368. The timing generator 368 is connected 
generally to the elements of the camera 300, as shown in Fig. 8, for controlling the digital conversion, compression, 
and storage of the image signal. The microprocessor 362 also processes a signal from a scene sensor (photodiode) 
370 for determining a proper exposure, and accordingly signals an exposure driver 372 for setting the aperture and 
shutter speed via the aperture/shutter controller 341 . The CCD sensor 344 is then driven from the timing generator 
55 368 via a sensor driver 374 to produce the image signal. 

[0047] The microprocessor 362 may accept meta data to be recorded on the digital record from the camera user 
inputs 364 or from camera mechanism inputs 380, such information pertaining, for example, to the desired display 
parameters, lens parameters (e.g., focal length, F-number, camera lens focus range), shutterspeed, autofocus distance 
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measurements of subject and background, backlight and flash fire state indicators, and the like, for ultimate use by the 
photofmisher. The user inputs 364 can also include the resolution setting and gain factor of the camera (number of 
pixels in the captured image and the sensor-based ISO speed, if such is settable). If a suitable input device is provided, 
for example a keypad or a voice-actuated input, demographic data could be generated at this time by operator input 
of the information, although other techniques may be employed without limitation. The microprocessor 362 may also 
accept scene related Information from the scene sensors 370 to be recorded on the digital record for ultimate use by 
the photofinisher. Such information may include the ambient light level of the primary subject and background, and the 
like. The microprocessor 362 may also accept camera shake data (measure of handheld stability of the camera) from 
a shake sensor 382. 

[0048] Certain camera meta data may be contained In the camera PROM 328, which is connected to the digital 
signal processor 352. The camera PROM 328 includes camera-specific data, such as main flash and preflash guide 
number, camera to flash separation, sensor ISO speed, resolution setting, camera shake factor, and the like. Such 
camera-specific data may be variable (for example, If the flash unit is separable, movable, or otherwise adjustable) or 
invariant (for example, if the flash is non-movable and integral with the camera). Likewise, the sensor ISO speed may 
be the base ISO speed and the resolution setting may be the native setting, if this data is invariant. Different data 
structures may be used to transfer the meta data and the image data from the camera. For example, the digital signal 
processor 352 may write the meta data into a camera header, followed by individual image trailer records. In another 
data structure, the meta data Is written into individual camera headers together with Individual image trailer records. 
Alternatively, certain of the meta data, such as the camera-specific data stored in the PROM 328 may be contained in 
a computer file 330 (instead of, or in addition to being, in the PROM 328), which is provided as a floppy disk or the like 
in combination with the camera 300. This meta data is then accessed by the host computer through a conventional 
disk drive interface (not shown) when the user loads the disk into the interface. The meta data may also be embedded 
with the image data in a form of digital watermarking, e.g., as taught in U.S. Patent No. 6,044,156, entitled "Method 
for Generating an Improved Carrier for Use in an image Data Embedding Application", which is incorporated herein 
by reference. 

[0049] It should be understood from the foregoing description of meta data creation in connection with film and digital 
cameras that other forms of meta data pertaining to camera, scene, demographic and display factors would be known 
to those of skill in this art, and are intended to be within the ambit of this invention. Likewise, other structures and 
mechanisms for the transfer of such meta data to subsequent utilization devices, such as digital photofinishers, would 
be clear to the skilled person and are intended to be within the scope of this invention. In addition, while the capture 
devices are described as film and digital cameras, it is possible that other capture devices such as a linear or area 
scanner could benefit from the invention, and to that extent are also included within the inventive concept. 
[0050] In the case of the film or digital camera, the camera, scene or demographic factors can be directly recorded 
on the output medium, or the microprocessor 21 5 (film) or 362 (digital) may be employed to perform image processing 
upon the factors, for instance to determine if the current image will have a level of defect that if left untreated would 
reduce the perceived quality of the final viewed image. Consequently, in the latter case, the microprocessors 21 5 and 
362 may perform some, or all, of the image processing performed by the digital image processor 40 shown in Figure 
1 , that is, to predict the presence of image defects in a photograph and subsequently to enable image defect correction 
means only when needed. 

[0051] The present invention can be implemented in computer hardware. For example, FIG. 2 represents a functional 
block diagram of a digital photofinishing system where the image acquisition block 50 includes image data and meta 
(e.g. APS IX) data from a capture device 52 such as the conventional photographic film camera 200 shown in Figure 
7 for recording a scene on color negative or reversal film, and a film scanner device for scanning the developed image 
on the film and producing a source digital image 56 and extracting the image meta data. Another example of an image 
capture device 52 is the digital camera 300 shown in Figure 8 that has the ability to produce a source digital image 56 
directly. The formed digital image file and associated meta data is transferred to the image analysis block 60, where 
the meta data decoder 66 extracts the information to be used in the image defect prediction process block 68. The 
primary purpose of the image defection correction process block 68 is to analyze available meta data pertaining to 
camera, scene, demographic, and image display factors, predict the presence of image defects in the final image 
display, and subsequently to activate only those correction processes in image processing block 70 that will improve 
the quality of the current image. 

[0052] Prior to providing a detailed description of image defect prediction process block 68, the other functions of 
the image analysis 60, image processing 70, and image output 90 blocks will be described. 

[0053] In the image analysis block, the full resolution digital image containing the red, green, and blue pixel data is 
subsampled in block 62 to create a smaller, for example 24 x 36 pixel, image that is analyzed by the scene balance 
algorithm 64. Scene balance algorithms use analog or digital processing to obtain the correct color balance and overall 
lightness for each image. The algorithms are commonly known as "white-balance," "color-constancy" or "scene-bal- 
ance" algorithms. These algorithms can work on a single image, several images, or an entire set of images. An example 
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of a suitable scene balance algorithms is described by E. Goll et al., "Modem Exposure Determination for Customizing 
Photofinishing Printer Response", Journal of Applied Photographic Engineering, 2, 93 (1979), which is incorporated 
herein by reference. Further improvements in scene-balance algorithms include setting a degree of illuminant chromatic 
correction using inferential illuminant detection, as disclosed in U.S. Patent Number 6,133,983, which is incorporated 
herein by reference. Additional information on the film can help to characterize the variability of the chemical process 
that was used to develop the film. For example, as taught in U.S. Patent Number 5,649,260, which is incorporated 
herein by reference, at least one or more gray reference patches with known exposure could have been exposed on 
the film during manufacturing, and then used to provide full-order film color balance calibration information to the scene 
balance algorithm. 

[0054] The present invention may be practiced with any scene balance module such as the one described by Cok 
et al. in U.S. Patent Number 4,945,406, which is incorporated herein by reference. The scene balance module calcu- 
lates the pixel values of a theoretical 20% gray card corresponding to the exposure of the scene digital image. A look- 
up-table is calculated and applied to the scene state digital image, which results in a balanced digital image. Although 
no scene balance module performs perfectly at the task of compensating the digital image for variations in exposure 
and illumination color effects, the scene balance module does improve the accuracy of the color representation of the 
digital image. Because the scene balance algorithm is needed for nearty all images, due to imperfections in image 
capture exposure and/or illuminant color balance, and the scene balance algorithm is computationally efficient, due to 
the use of a smaller subsampled image, it Is applied to every image in the preferred embodiment of the present invention. 
[0055] However, in accordance with the invention, and as noted earlier, the remaining image processing steps, which 
are located in the image processing block 70, are selectively applied based on the output of the image defect prediction 
process 68. Forthis purpose, the digital image produced in block 56 is provided in separate chrominance and luminance 
channels, as needed, for the subsequent image processing steps. 

[0056] If activated by the image defect prediction process 68, the noise correction process 72 is applied prior to the 
scene balance shift 76 to make use of the unbalanced color channel specific exposure information of the capture. The 
Sigma filter, described by Jong-Sen Lee in the aforementioned journal article Digitai image Smoothing and the Sigma 
Filter, which is incorporated herein by reference, is a noise reduction algorithm to enhance the visual appearance of 
the processed digital image. The values of the pixels contained in a sampled local region, n by n pixels where n denotes 
the length of pixels in either the row or column direction, are compared with the value of the center pixel, or pixel of 
interest. Each pixel in the sampled local region is given a weighting factor of one or zero based on the absolute difference ?. 
between the value of the pixel of interest and the local region pixel value. If the absolute value of the pixel value 
difference is less or equal to a threshold, the weighting factor is set to one. Otherwise, the weighting factor is set to 
zero. The numerical constant £ is set to two times the expected noise standard deviation . Mathematically the expression 
for the calculation of the noise reduced pixel value is given as: 

<Wn - *ij a y Py / £ fj a {j EQ. 1 

and 

a iT 1 "Pl-Pmn^e 



a U = Oif, P0-Pmn , >£ 

where p y represents the ij 1h pixel contained in the sampled local region, p mn represents the value of the pixel of interest 
located at row m and column n, a^ represents a weighting factor, and q mn represents the noise reduced pixel value. 
Typically, a rectangular sampling region centered about the center pixel is used with the indices i and j varied to sample 
the local pixel values. 

[0057] The signal dependent noise feature is incorporated into the expression for e given by: 

e*Stacc n (p mn ) EQ.2 

where o n represents the noise standard deviation of the source image evaluated at the center pixel value p mn . The 
parameter Sfac is termed a scale factor can be used to vary the degree of noise reduction. The calculation of the noise 
reduced pixel value q mn as the division of the two sums is then calculated. The process is completed for some or all 
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of the pixels contained in the digital image channel and for some or all the digital image channels contained in the 
digital image. The noise reduced pixel values constitute the noise reduced digital image. 

[0058] A median filtermay also be used as a noise reduction algorithm to reduce the noise present in a digital image. 
The noise reduced pixel value produced with a median filter is typically derived by calculating the statistical mean of 

5 values taken from a sampling region centered about the pixel of interest. Typically an n by n square window size is 
chosen where n denotes the length of pixels in either the row or column direction. The degree of noise reduction is 
controlled by the size of the window. Larger window sizes result in more noise removed from the digital image. 
[0059] If activated by the image defect prediction process 68, the redeye correction process 80 locates and removes 
eye color defects. The present invention may be practiced with a variety of methods which locate and correct redeye 

*o defects. One suitable method is disclosed by Benati et al. in U.S. Patent Number 5,748,764, which issued 5 May 1998 
and which is incorporated herein by reference. This method locates redeye defects in an image and provides separate 
corrections for body, border, and glint pixels in the pupil of the affected eye. The redeye defect detection process 
involves defining a spatial region within the digital image in which one or more eye color defects may exist, which 
includes at least a portion of the subject's head; sampling the color content of the pixels within the spatial region and 

15 comparing the sampled pixels with threshold values indicative of redeye defect pixels; segmenting the potentially de- 
fective pixels into contiguous groups; calculating a first score for each pixel of each group based on a plurality of 
features including group size, group shape, coloration, and brightness to identify redeye defect candidates; selecting 
a seed pixel based on its score from each identified eye color defect group candidate and determining all of the neigh- 
boring pixels which are within a predetermined score range of their neighboring pixels and those pixels which represent 

20 a significant pixel score transition indicative of the outer boundary of the redeye defect. Each of the three redeye defect 
pixel types (body, border, glint) is rendered differently to remove the defect and create a natural appearing correction. 
[0060] An alternative embodiment of the present invention employs a method of applying a redeye defect location 
and correction process disclosed by Schildkraut et al. in commonly-assigned U.S. Patent Number 6,292,574 issued 
September 18, 2001, entitled A Computer Program Product for Redeye Detection. This process is advantageously 

25 used in conjunction with the method disclosed in commonly-assigned U.S. Patent Number 6,1 51 ,403 issued November 
21, 2000 entitled Method for Automatic Detection of Human Eyes in Digital Images by Jiebo Luo. Both patents are 
incorporated herein by reference. 

[0061] Many natural scenes photographed under ambient lighting conditions result in photographic images which 
have a luminance dynamic range that far exceeds the dynamic range of conventional display systems. For example, 

30 photographic images taken in sunny outdoor conditions can have 1 0 or more photographic stops of recorded information 
while photographic paper can reproduce approximately seven photographic stops of information. In addition, as noted 
above, electronic flash illumination, by virtue of the distance-induced exposure difference between the main subject 
and background, can also produce dynamic range that exceeds the capacity of the chosen display. In digital imaging 
systems scene dependent tone scale function algorithms may be employed to reduce the dynamic range of the source 

35 digital image thus providing a better match of the processed digital image to the dynamic range capabilities of the 
output medium. 

[0062] If a high dynamic range scene is anticipated by the image defect prediction process 68, the scene dependent 
tonescale correction process 84 is employed. The tone scale correction process 84 uses the pixels in the balanced 
digital image to calculate a tone scale function, i.e., a single valued mathematical equation or transformation that has 
40 a single output value corresponding to each input value. The present invention implements the tone scale function as 
a look-up-table for computation efficiency. The result of the application of the tone scale processing produces a tone 
scale adjusted digital image such that the tone scale, or brightness and contrast, of the digital image is enhanced 
without modification of the color content. 

[0063] The present invention may be practiced with a variety of methods that generate tone scale functions. The 
45 preferred embodiment of the present invention uses the methods disclosed in US Patent Nos. 4,731 ,671 and 5,822,453, 
which are both incorporated herein by reference. These methods are employed by the present invention to produce 
two individual tone scale functions. These two tone scale functions are then cascaded into single tone scale function 
which is used to adjust the brightness and contrast of the balanced digital image. 

[0064] In U.S. Patent Number 5,822,453, Lee and Kwon disclose a method of calculating a tone scale function using 
so the pixel values of a digital image, and involving the estimation of the scene contrast from the digital image. The method 
taught by Lee and Kwon involves calculating a Laplacian filtered version of the digital image; forming a histogram of 
the Laplacian signal; determining from the Laplacian histogram two threshold values which when applied to the Lapla- 
cian signal substantially eliminate uniform areas; sampling pixels from the digital image which are based on the thresh- 
olds; forming a histogram from the sampled pixels; computing a standard deviation of the sampled histogram; and 
55 estimating contrast of the digital image by comparing the computed standard deviation with a predetermined contrast 
for determining contrast of the input image in relationship with the predetermined contrast. The method described by 
Lee and Kwon is used to calculate a first tone scale function. 

[0065] In U.S. Patent Number 4,731 ,671 , Alkofer discloses a method of calculating a tone scale function using the 
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pixel values of a digital image based on normalizing the histogram of a digital image. This method involves determining 
the contrast of the digital image by calculating the standard deviation of a sample of pixel values.. The second tone 
scale function is calculated by normalizing a histogram of the sample of pixel values. The sample of pixel values is 
selected from one of a plurality of samples of pixel values corresponding to a plurality of contrast intervals based upon 
the shape of the histogram of the selected sample of pixel values. To facilitate the adjustment of contrast, the tone 
scale function is constructed to produce values in units of a standard normal variate Z. These 2 values are then mul- 
tiplied by a constant, which is a function of the standard deviation of the sample of pixel values to determine the contrast 
of the processed digital image. 

[0066] The first and second tone scale functions are combined into a final tone scale function using the mathematical 
formula: 

LUT f = LUT^LUTgffl] E Q. 3 

where LUT 2 represents the second tone scale function, LU^ represents the first tone scale function, and LUT f repre- 
sents the final tone scale function. The j variable representthe index of pixel values of the digital image to be processed. 
The final tone scale function LUT f is calculated by evaluating the expression of equation 3 for the range of possible 
pixel values. 

[0067] The final tone scale function LUT f and the balanced digital image is received by the tone scale correction 
block 84. The present invention applies the final tone scale function to the luminance digital image channel of the 
balanced digital image to adjust the brightness and contrast attributes of the digital image. The preferred embodiment 
of the present invention applies the final tone scale function, in the form of a look-up-table, directly to the pixels of the 
luminance digital image channel of the balanced digital image. This method-is preferred primarily for its computational 
efficiency properties. 

[0068] An alternative embodiment of the present invention employs a method of applying a tone scale function dis- 
closed by Lee et al. in U.S. Patent Number 5,012,333, which is incorporated herein by reference, for improved image 
quality results. Although Lee et al. describe a method for interactively modifying image attributes, the present invention 
employs the method of applying tone scale functions to digital images based on spatial filtering techniques. This method 
involves spatially filtering the luminance digital image channel resulting in two spatial frequency components (high and 
low frequency components), applying the tone scale function to the low spatial frequency component, and combining 
the tone scale modified low spatial frequency component with the high spatial frequency component. This approach, 
employing frequency separable tone scale manipulation and sharpening, is superior to methods such as those dis- 
closed in U.S. Patent Number 5,739,924, which involve emphasizing the outline and the contrast of the subject based 
on subject brightness, and subject distance. 

[0069] In the preferred embodiment of the present invention, if a scene with sharpness problems is anticipated by 
the image defect prediction process 68, the sharpness correction process 88 is employed. The sharpness correction 
block 88 receives the tone scale adjusted digital image from the tone scale module 84 and applies a spatial filter to 
the tone scale adjusted digital image to adjust spatial modulation content. The present invention may be practiced with 
a variety of different spatial filters; however, a key aspect of the present invention relies on the combination of the 
method of manipulation of the color, tone and spatial detail attributes of a digital image. An example of a spatial filter 
that may be used is described by Kwon et al. in U.S. Patent Number 5,398,077, which is incorporated herein by ref- 
erence. Kwon et al teach a method of spatially processing a digital image involving transforming a red-green-blue 
image into a luminance chrominance domain and applying an adaptive filter to the luminance channel. The adaptive 
filter employs a method of calculating a statistical measure of local spatial activity and varying the sharpness of the 
image detail structure based on the statistical measure. The result of the application of the spatial filter produces a 
tone scale adjusted digital image with modified values such that the spatial detail of the digital image is enhanced 
without modification of the color content. 

[0070] The image output block 90 receives the modified digital image from the sharpness correction block 88. The 
digital image processing steps conducted within the output device rendering block 92 involve transforming the pixel 
values of the modified digital image into a corresponding set of device code values to account for the color manipulation 
characteristics of the output device and media. The transformation between device code values and the colorimetry 
of the colors reproduced by a particular device/media combination can be obtained by a device characterization. An 
example of a device characterization is a procedure that involves generating and printing or displaying a suitable array 
of device code values in the form of color patches of a size large enough for subsequent measurement. These patches 
can be measured using a colorimeter, a spectrophotometer or a telespectro radiometer, depending on the nature of the 
display. If spectra are measured, CIE XYZ values and other related quantities such as CIELAB or CIELUV values can 
be calculated for the display illuminant using standard colorimetric procedures. This data set can be used to construct 
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the appropriate sequence of one-dimensional look-up tables, multidimensional look-up tables, matrices, polynomials 
and scalars that accomplishes that transformation of the digital representation ofthe scene resulting from the combined 
processing operations performed in the output device rendering block 92 into a set of device code values that produces 
this desired visual representation of the scene. Another example ofthe implementation of this transformation is an ICC 
profile that maps the specifications of the desired visual reproduction, encoded in profile connection space (PCS), to 
device code values. 

[0071 ] This operation may also include gamut mapping. The color gamut characteristics of the modified digital image 
are determined by the set of primaries that was used for encoding the data. Examples include the primaries corre- 
sponding to the color-matching functions of the CIE 1931 Standard Colorimetric Observer or any linear combinations 
thereof. Gamut mapping is performed between the gamut defined by this encoding and the gamut of the output device/ 
media combination. The preferred gamut mapping algorithms used in combination with this invention are those that 
maintain hue. 

[0072] From an imaging processing point of view, the data transformation performed by the output device rendering 
block 92, whether dedicated to neutral balance or color gamut functions can be combined to form a single set of one- 
dimensional look-up tables, multidimensional look-up tables, matrices, polynomials and scalars in any sequence. Re- 
productions according to the specifications of this invention can be produced by a variety of technologies. Reproduc- 
tions can be obtained on silver halide or other light-sensitive materials. 

[0073] The light-sensitive material, as used by an image output device 96, may be transparent film, reflective paper, 
or semi-transparent film. These materials are exposed by visible or infrared light derived from many different sources! 
The materials may be designed for typical photofinishing applications or they may be specially designed for digital 
printing applications. The photo-sensitive materials respond primarily to three different spectral regions of incident light. 
Typically, these are red (600-720 nm), green (500-600 nm), and blue (400-500 nm) light. However, any combination 
of three different spectral sensitivities can be used. These could include green, red, and infrared light or red, infrared 
1 , and infrared 2 light, or 3 infrared lights of different wavelengths. Or a material sensitive to the three primary wave- 
lengths of visible light may be false sensitized so that the color of the exposing light does not produce image dye of 
the complementary hue, such as red, green, and blue sensitivity producing magenta, yellow, and cyan dye, respectively. 
Printing can be effected by exposing all pixels sequentially, by exposing a small array of pixels at the same time, or by 
exposing all the pixels in the image at the same time. 

[0074] Devices which can be used to print on light-sensitive materials include CRT, LED (Light Emitting Diode), LVT 
(Light Valve Technology), LCD, Laser, as well as any other controlled optical light generating device. All these devices 
have the ability to expose 3 or more light-sensitive layers in a light-sensitive material to produce a colored image. They 
differ mainly in the technology on which the devices are based. A suitable embodiment of a CRT printer is the Kodak 
Digital Science LF CRT Color Printer which can be used in combination with Kodak Professional Digital III Color Paper. 
[0075] Non-light-sensitive imaging materials are conveniently used by electronic printing processes to produce high- 
quality reproductions. The printing process can be based on many technologies. The method of image formation can 
be half-tone, continuous tone, or complete material transfer. The imaging material can be transparent film, reflective 
paper, or semi-transparent film. The materials can be written on to produce pictorial images by thermal dye transfer, 
ink jet, wax, electrophotographic, or other pixelwise writing techniques. These processes use three or more colorants 
to create colored pictorial representations of pictorial scenes. The colorants may be dyes, toner, inks, or any other 
permanent or semi-permanent colored material. A suitable embodiment of a thermal printer is the Kodak XLS 8650 
thermal dye transfer printer. 

[0076] In addition to hardcopy viewed images, it is also possible with the current invention to efficiently create pro- 
jected images. Many technologies are appropriate for this kind of image generation. All these techniques rely on pro- 
ducing color images with two or more colored lights. These are typically red, green, and blue in nature although they 
can be any set of primaries. Devices which can be used to create the preferred viewed reproduction include CRT, LCD, 
EL (Electro-Luminescence), LED, OLED (organic LEDs), light bulbs, lasers, plasma display panels, or any other three 
or more colored lighting apparatus capable of pixelwise illumination. The images can be created by display within the 
device, projection, or backlighting. Many devices create an image on a screen or display area which is physically a 
part of the mechanical unit. However, images can also be created by optically projecting the image in the form of light 
rays from behind or in front of the viewer toward a screen which is in front of a viewer or by projecting a reversed image 
toward the viewer onto a screen between the viewer and the projecting device. A suitable embodiment of a CRT display 
is a Sony Trinitron CRT. 

[0077] This concludes the detailed description of the functions of the image analysis 60, image processing 70, and 
image output 90 blocks. A detailed description of the image defect prediction process block 68, that controls the state 
(active or inactive) of each of the previously detailed image defect correction processes in the image processing block 
70, will now be provided. 

[0078] After the meta data associated with the current image is decoded in block 66, a number of intermediate 
parameters are calculated in the image defect prediction block 68 from the decoded data and subsequently used in 
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the Noise Defect Prediction Block 100 shown in FIG. 3, the Redeye Defect Prediction Block 120 shown in FIG. 4, the 
Tonescale Defect Prediction Block 140 shown in FIG. 5, and the Sharpness Defect Prediction Block 160 shown in FIG. 
6. Other parameters such as Flash Fire State (on/off) can be used directly, and still other parameters may require units 
conversion; for example, ambient light level, which is converted from camera BV (brightness value) to luminance units 
in foot lamberts. The intermediate parameter values, calculated from decoded meta data, and used in multiple defect 
prediction blocks, are calculated once and shared among the defect prediction blocks 100,120,140,160. However, for 
purposes of illustrating the functionality of each of the defect prediction blocks, the processing steps (104,124,144, 
164) that follow the meta data input block show the creation of each of the intermediate parameters. 
[0079] This section defines the meta data items and shows the manner in which the intermediate parameters are 
calculated from the meta data items in the preferred embodiment. Those skilled in the art will understand that the units 
of distance, exposure, flash output, system magnification, and image defect level can be recast without departing from 
ihc intent of the teachings of the present invention. In the preferred embodiment the following meta data items, or a 
subset thereof, collected by camera sensors and/or manual photographer input are employed in the image defect 
prediction blocks (100,120,140,160): 

General Parameters: 

SaDjec: Dcnoq'Hpnc Data includes: subject race and age 
U»e*- Specified or Camera-Measured Parameters: 
IOOSO] 

•t . - • • i K*c*iight indicator State (on/off;on = high contrast scene) 

CSf : r- « • i >-Mke Factor (measure of handheld stability of camera) 

D ; 2 ^ :>*^onsion of the Final Display (inches) 

OG : > : P actor (multiple of sensor base ISO speed) 

OR *wm 4 j?ion Setting (number of pixels in captured image) 

D* ;*.^rt o Subject Distance (feet) 

Db Z*ru- to Background Distance (feet) 

I Ch~>:t« .ens F-number 

Ft Crf-ncr* Lens Focal Length (inches) 

FF ► us* Fire State (on/off) 

FLS RrtsMo-Camera Lens Separation (center-to-center, inches) 

GNm Mrt n Flash Output (Guide Number, Current ISO, feet) 

GNp. Prennsh Output (Guide Number, Current ISO, feet) 

K ANSI Lens Exposure Constant (default =3.91) 

LFR Carrera Lens Focus Range (inside or outside of range) 

LS Linear Smear (in mm at capture plane during exposure) 

LLs Ambient Light Level of Primary Subject (camera light meter reading in foot lamberts) 

LLb Ambient Light Level of Background (camera light meter reading in foot lamberts) 

Mc Current Reproduction (e.g. printing) Magnification 

Ms: Standard Reproduction (e.g. printing) Magnification 

Mn Non-Standard (e.g. enlargement) Magnification 

S: Film or DSC Sensor ISO Speed 

T: Camera Snut-er Speed (seconds) 

Intermediate Parameters: 

[0081] In the preferred. embodiment, a number of intermediate parameters are calculated from the meta data listed 
above and employed in the image defect prediction blocks (100,120,140,160). 

[0082] The intermediate parameters quantifying the degree of exposure of the main subject and the background, 
which have been found to be useful in the noise defect prediction 100 and tonescale defect prediction 140 blocks, are 
calculated as follows: 
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[0083] 
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Esf for flash illumination = log 10 [(GNm/Ds)/f] 2 EQ. 4 

Esa for ambient illumination = log 10 [LLs/LLn] Eq 5 

[0084] Where LLn (EQ.6) is defined as the light level (foot lamberts) where an ISO normal exposure occurs with the 
current camera settings, and is found with the following equation, set forth in the ISO/ANS I standard for qeneral-pumose 
photographic exposure meters ANSI 3.49-1 987. K 

LLn = (K)(f 2 )/(S)(T) EQ 6 

Flash Illumination Example 
[0085] 

Let camera main flash GN = 48 for ISO 200 film 
Ds (subject distance) = 6 feet 
f (lens f-number)= 5.6 

Esf= log 10 [(AB/eys.S] 2 = 0.30 log E 

(one-stop over exposed) 

Ambient (natural) Illumination Example 

[0086] 

Let LLs (light level of subject) = 4 foot lamberts 
K = 3.91 

f (lens f-number) = 4 

S (ISO film speed) = 400 

T (shutter time, seconds) =0.01 

LLn = [(3.91)(4 2 )]/[(400)(0.01)] = 16 foot lamberts 
Esa = Iog10 (4/16) = -0.6 log E 



(two-stops under exposed) 

[0087] The flash and ambient exposures for the background are calculated in the same fashion, but in this case the 
so measured camera-to-background distance data and background illuminance levels are used in the equations. 

_Eb: Exposure on Background(loq i n E units) 

[0088] 



55 



Ebf for flash illumination = log 10 [(GNm/Db)/f] 



EQ.7 
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Eba for ambient illumination = log 10 [LLb/LLn] EQ. 8 

[0089] Where LLn (EQ.6) is defined as the light level (foot lamberts), in this case for the background portion of the 
scene, where an ISO normal exposure occurs with the current camera settings. 

[0090] The intermediate parameters, hereafter identified as FER (Rash Exposure Ratio) and AER (Ambient Expo- 
sure Ratio), which quantify the ratio between the exposure on the main subject and the background, and have been 
found to be useful in the Tonescale Defect Prediction Block 140, are calculated as follows: 

FER: Flash Exposure Ratio 

FER = 10 Es, - Eb " EC. 9 

AER: Ambient Exposure Ratio 

a rro 4 n IEsa-Ebal , 

AER =10 EQ. 10 

[0091] The absolute value signs are needed in the exponent term of EQ.9 and EQ.10 to reflect the fact that the 
magnitude of the exposure ratio, whether the subject or background is receiving the dominant exposure, is the key 
parameter. 

[0092] The intermediate parameters quantifying the format and subject reproduction magnifications as perceived in 
the final display, which have been found to be useful in the Noise Defect Prediction Block 100, the Redeye Defect 
Prediction Block 120, and the Sharpness Defect Prediction Block 160, are calculated as follows: 

AM: Angular Magnification of Subject 

[0093] In the Sharpness Defect Prediction Block 160, at decision point 172, the AM parameter has been found to be 
a useful predictor for determining the maximum handheld shutter time required to minimize hand-tremor-induced blur 
for the average photographer. And in the Redeye Defect Prediction Block120,at decision point 130, the AM is a useful 
predictor in that it correlates with the size of the subject's eyes, and therefore contributes to the perceived severity of 
the redeye image defect. 

AM = [(FL)(Mc)]/Vd EQ.11 



Where: 

Vd: Final Image Viewing Distance (specified in inches) 

[0094] As disclosed in columns 43-44 of U.S. Patent Number 5,323,204, which issued 21 June 1994 to the present 
assignee and which is incorporated herein by reference, and contrary to conventional wisdom, human observers do 
not view photographs at distances that are linearly related to the display size. In this regard, empirical-based perceptual 
measurements of eye-to-display separation suggest that the average viewing distance (Vd) for handheld prints of 
varying sizes can be characterized as follows: 

Vd = 3.64+1 1.34[log 10 (D)] EQ. 12 

Where: 

D = the diagonal dimension of the final display (specified in inches) 

Viewing Distance Examples 

[0095] 

Vd for 4 by 6 inch print = 3.64+1 1 .34[log 10 (7.2)] = 13.4 inches 
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Vd for8 by 12 inch print = 3.64+1 1.34[log 10 (1 4.4)] = 16.8 inches 



MST: Maximum Handheld Shutter Time (seconds) 

[0096] The maximum handheld shutter time is an intermediate parameter that specifies for the current camera lens 
focal length, reproduction magnification, and final image viewing distance, the longest handheld shutter time that can 
be employed by a person with average hand tremor with out causing noticeable blur to be perceived in the final image. 
The MST is affected by the camera shake factor (CSF), which specifies the stability of the camera. The CSF is one of 
the items that may be included in the general camera meta data. If the CSF is not available, the default value of unity, 
appropriate for typical 35mm point-and-shoot cameras is used. We have found that interchangeable lens (SLR) cam- 
eras are more stable and typically have CSF values of about 1 .4. 

MST = [(0.169)(Vd)(CSF)] / [(FL)(25.4)(Mc)) EQ. 13 

Where tnc constant 0.1 69 is needed to fit the experimental data such that the result of the MST equation Is the longest 
ciposu'- imc iha: will not produce significant image blur. The value of this constant was derived by rating the quality 
c' i>*».j: l> v * representative population of photographers, where the exposure times were systematically varied. 

T r»t c ;•<_ 4 converts the camera lens focal length from inches to mm. 

y -»* rn '~ M * - ^ - Shutter Time Examples 

[0097] - ft- SLR camera, 35mm lens, 4X6 inch print: 

FL • 3r .*v*. 
Uc 4^ 
Vd > .rv^ 
CSF t 4 



MST = [(0.1 69)(13.36)(1 .4)] / [(1 .3B)(25.4)(4.44)] = 0.02 second 

[0098] APS point-and-shoot camera, 25mm lens, 4X10 inch panoramic print: 

FL 0 98 inch 
Mc 10 6 
Vd 15 35 inch 
CSF • 1 C 



MST = [(0.1 69)(15.35)(1.0)]/[(0.98)(25.4)(1 0.6)] = 0.01 second 

[0099] Those examples show that a shorter exposure time Is needed to maintain an acceptable level of hand-tremor- 
induced blur with higher reproduction magnification and less stable (lower CSF) cameras. 

DSF: Display Size Frictor 

[0100] The present authors disclose the Display Size Factor (DSF) in columns 44-45 of the aforementioned U.S. 
Patent Number 5,323,204, which quantitatively accommodates the independent selection of display size and repro- 
duction magnification. The DSF was conceived in orderto account for the differences between full-frame reproductions 
and cropped image reproductions where the final display size is not the product of the capture media format size and 
the reproduction magnification. This occurs when pseudo-panoramic or pseudo-telephoto (electronic zoom) features, 
which are popular on APS and DSC cameras, are selected by the photographer and/or when a device such as a Kodak 
Picture Maker is used to selectively zoom and crop a portion of a full-frame image to create a new composition. DSF 
has been found lo be advantageous when predicting the severity of defects that vary in response to the reproduction 
and viewing magnifications, ratherthan the subject reproduction magnification, as was the case with the AM parameter. 
For example, the DSF parameter is used in the Noise Defect Prediction Block 1 00 to create lookup tables for decision 
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point 106, and in the Sharpness Defect Prediction Block 160 at decision point 166. 

[0101] These are cases where the perceived quality loss due to the image defect is correlated with the degree to 
which the structure of the image is magnified. 

DSF = (Ms/Mn)(Vdn/Vds) EQ. 14 

Where: 

10 Vds is the viewing distance for the standard display size 

Vdn is the viewing distance for the non-standard display size 

Display Size Factor Examples 

15 [0102] 35-mm format camera, 4X6 inch full-frame print: 

Ms = 4.44 
Mn = 4.44 
Vds = 13.4 inches 
20 Vdn = 13.4 inches 

DSF = (4.44)/(4.44)(1 3.4/13.4) = 1.0 

*5 [01 03] 35-mm format camera, 8X1 2 inch full-frame print: 

Ms = 4.44 
Mn =8.88 
Vds = 13.4 inches 
30 Vdn = 16.8 inches 

DSF = (4.44/8.88)(1 6.8/1 3.4) = 0.63 

35 [0104] 35-mm format camera, 2X electronic zoom (EZ) 4x6 inch print: 

Ms = 4.44 
Mn = 8.88 
Vds = 13.4 inches 
40 Vdn = 13.4 inches 

DSF = (4.44/8.88)(1 3.4/1 3.4) = 0.5 

45 [01 05] These examples show that the DSF decreases when the reproduction magnification increases and the viewing 
distance decreases. For example, the full-frame 8X12 inch case and the 2X EZ 4X6 inch print case have the same 
reproduction magnification (Mn), but the 2X EZ case produces a smaller print and therefore a closer viewing distance 
that leads to a smaller DSF, which correlates with lower perceived image quality. 

[01 06] As outlined in the Background section, each of the image defects included in the inventive process was studied 
so to determine the relationship between the amount of defect present and the perceived image quality loss. The image 
quality loss data was correlated with scene, camera, demographic, and display parameters to develop the predictors 
employed in Image Defect Prediction Blocks 100,120,140 and 160, which were found to be useful in determining when 
to activate the digital image defect correction processes contained in Blocks 72,80,84 and 88. The empirical studies 
leading to the development of the predictors involved the generation of images of a variety of scene types containing 
55 camera, illumination, display, and subject characteristics spanning the range encountered in consumer photography. 
By varying the imaging system and scene characteristics in this fashion, a population of images containing a wide 
range of image defects was produced. These images were visually rated for quality using the perceptually uniform 
JND scale previously described, and their associated capture and display characteristics were analyzed to develop 
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the final parameters suitable tor predicting the probability and severity of image defecis. 
SP: Switch Point (on/off) Values for Decision Points 

[0107] From this perceptually-derived image quality data, the predictor values, hereafter denoted as switch-point 
(SP) values, are selected for each decision point parameter 106,128,130,132,148,152,166,170,172,174 that corre- 
spond with 3 JNDs of image quality loss. As discussed at the beginning of this section, the 3 JND value, which corre- 
sponds with about one-half of a subjective image quality category, was selected forthe preferred embodiment because 
it provides a good balance between image processing efficiency (number of images processed per unit time) and image 
quality improvement. The SP values in the table should cover the range of conditions encountered in consumer pho- 
tography. If a general or intermediate parameter falling outside of the current SP values in the decision tables is en- 
countered, the table value closest to the current parameter is applied in the decision process. If a general or intermediate 
parameter falling between the current SP values in the decision tables is encountered, intermediate values may be 
obtained by well-known interpolation techniques. 

[0108] A detailed description of the processes contained in Image Defect Prediction Blocks 100,120,140 and 160 
will now be provided. To illustrate the functionality of the Image Defect Prediction Blocks in the preferred embodiments 
of the present invention, SP parameter values for traditional film (e.g. AgX-based media) will be shown. However, It 
should be appreciated that the inventive image defect prediction processes are equally applicable to other capture 
media, whether based on chemical or electronic capture technologies. In the description that follows, the points in the 
process where specific parameter values vary depending on the image capture modality will be highlighted. 

FIG. 3: Noise Defect Prediction Block 100 

Block 102: Meta Data Input List 

[01 09] Camera Model Data 
K,Ms,Mn,D,LLs,f,T,GNm 

Block 104: Processes 

[0110] 

Calculate Vd EQ.12 
Calculate DSFEQ.14 
Calculate Esf EQ.4 
Calculate Esa EQ.5 

[01 1 1] Select a switch point from the SP Table appropriate for current display size factor (DSF) and Film Speed (S). 
Block 1 06: Decision Point 

[0112] Compare the calculated Es and SP Values: 

If calculated Es >= SP, omit noise correction process 108; 
If calculated Es < SP, execute noise correction process 110; 

[0113] In cases where the noise level of the current capture media and DSF combination does not require noise 
correction at any exposure (Es) level, an SP value substantially below any obtainable by the system (e.g. -100) is 
entered in the table. In cases where the noise level of the current capture media and DSF combination requires noise 
correction at every exposure (Es) level, an SP value substantially above any obtainable by the system is entered in 
the table. 



Table 1 : 



SP (Es) Values for Block 106 (3 JND Level) 


Speed S -> 


ISO 100 


ISO 200 


ISO 400 


Es @ DSF= 1.00 


-100 


-0.52 


-0.30 
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Table 1: (continued) 



SP (Es) Values for Block 106 (3 JND Level) 


Speed S -> 


ISO 100 


ISO 200 


ISO 400 


Es @ DSF = 0.85 


-048 


-0.13 


0 


Es @ DSF = 0.63 


+0.25 


+1.08 


+100 



[0114] Table 1 shows the Es (log 10 Exposure units) SP values for three display sizes and three film speeds. In this 
example, the DSF values are appropriate for a full-frame 35-mm format consumer grade color negative film frame 
enlarged to make a 4x6 inch print (DSF = 1 .0), a full-frame 35-mm format consumer grade color negative film frame 
enlarged to make a 5x7 inch print (DSF = 0.85), and a full-frame 35-mm format consumer grade color negative film 
frame enlarged to make a 8x12 inch print (DSF= 0.63). Since the noise versus exposure relationship may vary de- 
pending on the film technologies selected by the manufacturer, it may be necessary to employ different SP values for 
other film types. This is easily verified by shooting an exposure series on the new film and assessing the noise level 
in the final images. In the preferred embodiment, the SP Table includes all of the DSF possibilities for the particular 
image processing apparatus. For example, some digital printers may only produce prints with fixed magnifications, as 
shown in the current example, while others may offer a wide range of print sizes and intermediate zoom ratios. In these 
more versatile printers, the Noise SP Table preferably contains DSF entries corresponding to each print size and zoom 
magnification. Alternatively, intermediate SP values can be obtained by interpolating between values in the table. 
[0115] The Noise SP Table for digital still cameras (DSCs) also varies with respect to DSF; however, rather than 
loading in films with different sensitivities, as shown above, the photographer or camera exposure control selects a 
Digital Gain Factor (DG) appropriate for the current scene. Therefore, the DSC Noise SP Table lists DSF versus DG. 
The 3 JND quality loss values populating the DSC Noise SP Table can be derived with the empirical photographic 
testing and perceptual evaluations referenced above, alternatively, the values can be generated using the methods 
referenced by the present inventors in the aforementioned articles entitled "Characterization and Prediction of image 
Quality 1 by B.W. Keelan, and "Use of System Image Quality Models to Improve Product Design "by R. B. Wheeler. 

FIG. 4: Redeye Defect Prediction Block 120 

Block 122: Mete Data Input List 

[0116] Demographic Data (user-specified or region-specific) 

Camera Model Data 

FF,FLS,LLs,FL,Mc,D,GNp,Ds 

[01 1 7] The severity of redeye is a strong function of demographic classification of the subject photographed, due to 
two effects. First, more highly pigmented races tend to have more melanin in their pupils, which attenuates the light 
propagating through the pupil via absorption, and reduces the amount of red light exiting the eye. Second, as people 
age, the maximum diameter to which their pupil will dilate at low light levels decreases. Consequently, young people 
exhibit larger pupil sizes than older people under the conditions in which flash photographs are typically taken. The 
dramatic nature of the dependence of redeye severity on demographics is demonstrated in Table 2, which shows the 
frequency of occurrence of noticeable redeye in over one hundred subjects belonging to different demographic groups 
and photographed under rigidly controlled conditions (0.016 foot-candles ambient light; 2.6 inch flash-to-lens separa- 
tion; subject distance 6 feet; no preflash; focal length 80 mm; normally viewed 4x6 inch print from 35-mm format film). 



Table 2: 



Effect of Demographics on Frequency of Redeye 


Demographic Group 


Caucasian Youth 


Caucasian Adult 


Hispanic 


Asian 


African-American 


Frequency of Occurrence 


82% 


70% 


41% 


15% 


9% 



[0118] Both the pigmentation and age affects are evident in this data, although the age effect is more obvious at 
larger flash-to-lens separation (e.g. at 3.8 inches, with other parameters the same), where Caucasian adults exhibit 
redeye only 20% of the time, but Caucasian youth show redeye in 67% of photographs. 

[0119] The SP Tables employed in Redeye Defect Prediction Block 120 at points 128, 130 and 132 may contain 
different values depending on the demographic characteristics supplied. If no demographic data is available, the SP 
Tables for Caucasian Adult, which are shown hereafter in the preferred embodiment, are used as the default. 
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[0120] The flash-to-lens separation (FLS) has a significant impact on the level of redeye defect. However, FLS is a 
fixed value in most cameras (flash is stationary); therefore, the SP Tables employed in Redeye Defect Prediction Block 
1 20 at points 1 28, 1 30 and 1 32 would typically contain S P values derived from the quality evaluation of images produced 
with the FLS of the current camera. To accommodate cameras capable of changing the FLS in response to the pho- 
tographer's lens focal length (FL) selection, for example, as disclosed in U.S. Patent Number 5,331 ,362, which issued 
1 9 July 1 994 and is incorporated herein by reference, the current invention employs a separate SP Table for each FL 
(zoom setting) selected by the photographer. 

[0121] Table 3 shows the affect of FLS on image quality (JNDs) for the following system: Caucasian adult population; 
0.5 foot lamberts ambient light; subject distance 10 feet; no preflash; unit angular magnification; normally viewed 4x6 
inch print from 35-mm format film. 



Table 3: 



Effect of FLS on Redeye Defect Level 


FLS (inches) 


1.0 


1.5 


2.5 


3.5 


Quality Loss (JND Units) 


-4.9 


-3.25 


-1.4 


-0.4 



[0122] The SP Tables employed in Redeye Defect Prediction Block 120 at points 128, 130 and 132 are loaded with 
values that correspond to the specific FLS value supplied in the meta data. 

[0123] Empirical studies show that the quality loss due to the redeye image defect is greatest at an intermediate 
camera-to-subject distance (Ds), which falls within the range commonly occurring in consumer photography. This is 
due to the competition of two opposing effects: (1 ) At longer camera-to-subject distance, the angular separation of the 
flash and lens is reduced, leading to greater red saturation of the pupil, and; (2) At longer distances, the pupil size in 
the final image is diminished, reducing redeye severity. To a crude approximation, as distance increases, the redeye 
becomes more intensely red but the pupil size in the image decreases. The former effect dominates at short distances 
and the latter at longer distances, leading to an extremum in the relationship at intermediate distances. As a result of 
this finding, the SP Tables employed in Redeye Defect Prediction Block 120 at points 128, 130 and 132 in the preferred 
embodiment are populated with values corresponding to the critical camera-to-subject distance, which is defined as 
the distance producing the most severe defect level. This approach ensures that Redeye Correction Process 80 is 
applied when the probability of redeye is high but Ds meta data is not available. If Ds meta data is available, and added 
predictive accuracy is desired, SP Tables for each distance can be advantageously applied. 

Block 1 24: Processes 

[0124] 

Calculate Vd EQ.12 
Calculate AM EQ.11 

[0125] Select switch points from SP Tables appropriate for current demographic group and flash-to-lens separation 
(FLS). 

Block 126: Decision Point 

[0126] Determine the flash fire state (FF): 

If FF is no (off); omit redeye correction process 134; 
If FF is yes (on); proceed to Block 128; 

Block 128: Decision Point 

[0127] Compare the actual LL and SP Values: 

If actual LL >= SP, omit redeye correction process 134; 
If actual LL < SP, proceed to Block 130; 
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Table 4: 



SP (LL) Values for Block 128 (3 JND Level) 


FLS (inches) 


1.0 


1.5 


2.5 


3.5 


LL (foot lamberts) 


2.50 


1.04 


0.48 


0.19 



[0128] Table 4 shows SP value (LL in foot Lamberts) for the critical distance for each FLS with AM at 1 .5, which is 
a demanding case of the sort found on typical 3X zoom cameras. If FLS is not supplied in meta data, a value of one 
inch (smallest separation found on consumer cameras) is assumed. 

Block 130: Decision Point 

[01 29] Compare the calculated AM and SP Values: 

If calculated AM =< SP, omit redeye correction process 134; 
If calculated AM > SP, proceed to Block 132; 



Table 5: 



SP (AM) Values for Block 130 (3 JND Level) 


LL (foot lamberts) 


2.5 


1.0 


0.5 


0.2 


AM 


1.5 


0.82 


0.62 


0.43 



[0130] Table 5 shows SP value for FLS of 1 .0 inches and the critical distance for each AM. If LL is not supplied in 
meta data, a value of 0.2 foot lamberts is assumed. 

Block 132: Decision Point 

[0131] Compare the actual GNp and SP Values: 

If actual GNp >= SP, omit redeye correction process 134; 

If actual GNp < SP, execute noise correction process 136->80; 



Table 6: 



40 



SP (GNp) Values for Block 132 (3 JND Level) 


AM 


1.5 


1.2 


0.9 


0.43 


GNp 


160 


70 


15 


0 



[0132] Table 6 shows SP value for FLS of 1 .0 inches, at the minimum light level for typical flash pictures (0.2 foot 
45 lamberts), and the critical distance for each AM. When the AM is above 1 .2, the preflash guide number (GNp) values 
needed to produce less than 3 JNDs of quality loss indicate higher output than is typically found on consumer cameras; 
whereas, when the AM is below 0.43, the quality loss due to the redeye defect will be less than 3 JNDs with no preflash 
applied. 

so FIG. 5: Tonescale Defect Prediction Block 140 

Block 142: Meta Data Input List 

[0133] Camera Mode! Data 
55 FF,GNm,S,f,T f K,LLs,LLb,Ds,Db 
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Block 144: Processes 



[0134] 

s Calculate Esa EQ.5 

Calculate Eba EQ.8 

Calculate Esf EQ.4 

Calculate Ebf EQ.7 

Calculate AER EQ.10 
10 Calculate FER EQ.9 

Block 146: Decision Point 

[0135] Determine the flash fire state (FF): 

15 

If FF is no (off); proceed to Block 148; 
If FF is yes (on); proceed to Block 152; 

Block 148: Decision Point 

20 

[0136] Compare the calculated AER and SP Values: 

If actual AER =< SP, omit tonescale correction process 150; 
If actual AER > SP, execute tonescale correction process 156; 

25 

[0137] Alternatively, if the AER parameter is not available, but the backlight indicator (BL) is on, execute tonescale 
correction process 156. 

Block 152: Decision Point 

30 

[0138] Compare the calculated FER and SP Values: 

If actual FER =< SP, omit tonescale correction process 154; 
If actual FER > SP, execute tonescale correction process 156; 

35 

Where SP = 2.8, which corresponds to a 1 .5 stop exposure ratio. 

[0139] The preferred value for AER and FER was derived from a review of numerous images with a wide range 
subject and background exposure ratios, together with learning from the optimization of fill-flash algorithms, as dis- 
closed by the present inventors in columns 50-58 of the aforementioned U.S. Patent Number 5,323,204. The SP value 

40 of 2.8 was selected because we found that it provides a reasonable balance between image quality enhancement and 
photofinishing throughput (images processed per unit of time). If the values for AER and FER are set to a lower JND 
value, the tonescale image defect location and correction processes will be invoked more frequently, which may lead 
to higher average image quality, but lower throughput due to extended image processing time. If the values for AER 
and FER are set to a higher JND value, the image defect location and correction processes will be invoked less fre- 

45 qUently, which may lead to lower average image quality, but higher throughput due to shortened image processing time. 

FIG. 6: Sharpness Defect Prediction Block 160 

Block 162: Meta Data Input List 

50 

[01 40] Camera M odel Data 
Ms,Mn,Mc,LFR,FL,T,CSF,LS 

Block 164: Processes 

55 

[0141] 

Calculate DSFEQ.14 
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Calculate Vds EQ.12 
Calculate Vdn EQ.12 
Calculate AM EQ.11 
Calculate MST EQ.13 

5 

Block 166: Decision Point 

[0142] Compare the display size factor (DSF) and SP: 

10 if actual DSF >= SP, proceed to Block 168; 

If actual DSF < SP, execute sharpness 180; 

Where SP = 0.65, which produces about 3 JNDs of quality loss for typical consumer imaging systems. 

is Block 168: Decision Point 

[01 43] Determine the flash fire state (FF): 

If FF is no (off); proceed to Block 170; 
^0 If FF is yes (on); omit sharpness correction process 178; 

Block 1 70: Decision Point 

[0144] Determine the lens focus range state (LFR): 

25 

If Ds is not outside LFR; proceed to Block 172; 

If Ds is outside LFR; execute sharpness correction process 180; 

Block 1 72: Decision Point 

30 

[0145] Compare the actual exposure time T and MST: 

If T is =< MST, proceed to Block 174; 

If T is > MST, execute sharpness correction process 180; 

35 

Block 1 74: Decision Point 

[0146] Measure the linear smear at image capture plane (LS): 

If LS/DSF is =< SP omit sharpness correction process 176; 
If LS/DSF is > SP, execute sharpness correction process 180; 

Where LS = 0.04 mm at image capture plane, which produces about 3 JNDs of quality loss with unit DSF. 
Linear smear may be determined in a variety of ways, including the following. The aforementioned U.S. Patent Number 
5,323,204 described in element 225 of Fig. 2, and the hardware description beginning in column 26, line 38, an accel- 
erometer-based camera shake detector capable of measuring linear smear. Alternatively, linear smear at the image 
plane can be acquired by taking temporally separated autofocusing system readings and comparing the linear trans- 
lation of the signal patterns (e.g. quantifying the displacement of matching signal profiles as a function of acquisition 
time). 

[0147] The image defect prediction processes 100,120,140,160, as described above, have shown the preferred 
embodiments, but it is to be understood that they may be advantageously applied, even if all of the data is not available. 
For example, the hierarchy is purposefully set so the most readily available data is required earliest in each process, 
and if data is missing, the decision of the last block containing data has priority. For example, if after the first decision 
point, the data suggests that there is a probability of an image defect, but no additional data is available, the defect 
correction process will be executed. This is acceptable, because a "mistake" based on omission of data, simply means 
that a defect correction may be occasionally applied when not needed. This will not reduce the image quality, but merely 
lead to a slight reduction in the throughput advantage produced by the preferred embodiment. 
[0148] The invention has been described with reference to a preferred embodiment. However, it will be appreciated 



25 



that variations and modifications can be effected by a person of ordinary skill in the art without departing from the scope 
of the invention. For instance, while the foregoing description discloses an image processing system incorporating a 
plurality of image defect location and correction processes that are each associated with correction of a different type 
of image defect, the invention is intended to encompass an image processing system incorporating a plurality of image 
5 defect location and correction processes that are each associated with a specific type of image defect, where the 
specific type may be a different type or the same defect. The latter situation may apply when two or more processes 
are applied in series to the same defect depending on severity levels, or different processes may be applied individually 
for different severity levels of the same type of defect or depending upon the source of the defect. 

10 

Claims 

1 . A method for processing a captured image with one or more correction processes selected from a plurality of such 
processes, each associated with correction of a specific type of image defect, in order to improve the appearance 

'5 of a viewed image generated from the captured image, said method comprising the steps of: 

collecting meta data related to image capture that is unique to each image that is captured, wherein the meta 
data is capable of indicating whether the specific types of image defects are likely to be present in the viewed 
image generated from the captured image; 
20 predicting the presence of the image defects based at least in part on the meta data, thereby generating 

process application criteria which indicate a level of image defect that if left untreated would reduce the per- 
ceived quality of the viewed image; 

selecting one or more correction processes to employ on the captured image based on the process application 
criteria; and 

25 applying the one or more selected correction processes to the captured image to generate the viewed image. 

2. The method as claimed in claim 1 wherein the meta data includes scene, camera or demographic data related to 
the image capture. 

30 3. The method as claimed in claim 1 wherein the step of predicting the presence of the image defects also predicts 
the severity of the defects and the strength of the corresponding correction process can be altered in response to 
the degree of severity. 

4. The method as claimed as in claim 1 wherein the meta data related to image capture is collected at the time of 
35 image capture. 

5. The method as claimed as in claim 1 wherein the meta data related to image capture is collected at a time other 
than the time of image capture. 

40 6. The method as claimed in claim 1 wherein the image defect is a noise defect and the meta data is selected from 
the group consisting of a lens exposure constant, standard (printing) reproduction magnification, non-standard 
(enlargement) magnification, diagonal dimension of a final display, ambient light level of the primary subject, ex- 
posure time, camera lens f-number, and main flash guide number 

45 7. The method as claimed in claim 1 wherein the image defect is a red-eye defect due to flash illumination of a subject 
and the meta data is selected from the group consisting of a use (on -off) of the flash illumination, illumination level 
of the primary subject, subject distance, flash-to-camera lens separation, focal length of camera lens, current 
reproduction (printing) magnification, diagonal dimension of final display, and preflash guide number. 

so 8. The method as claimed in claim 1 wherein the image defect is a tone scale defect and the meta data is selected 
from the group consisting of respective illumination levels of the subject and background, subject distance, back- 
ground distance, exposure time, camera lens f-number, use (on/off) of flash illumination, guide number of a main 
flash, lens exposure constant and ISO speed of a capture device. 

55 9. The method as claimed in claim 1 wherein the image defect is a sharpness defect and the meta data is selected 
from the group consisting an exposure time, standard reproduction (printing) magnification, non-standard (enlarge- 
ment) magnification, current reproduction (printing) magnification, camera lens focus range, camera lens focal 
length , camera shake factor and -linear smear. 
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10. The method as claimed In claim 1 further comprising the step of collecting meta data related to display parameters 
of the viewed image generated from each image that is captured, wherein said meta data is capable of indicating 
whether the specific types of image defects are likely to be present in the viewed image. 
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